Reinforcement learning - PDFSEARCH.IO - Document Search Engine

Reinforcement learning
Results: 1147

#	Item
181	From Ads to Interventions: Contextual Bandits in Mobile Health Ambuj Tewari and Susan A. Murphy Abstract The first paper on contextual bandits was written by Michael Woodroofe inbut the term “contextual bandi Add to Reading List Source URL: dept.stat.lsa.umich.edu Language: English - Date: 2016-06-30 18:36:20 Machine learning Multi-armed bandit Stochastic optimization Reinforcement learning Pi Algorithm
182	Inverse Reinforcement Learning for Interactive Systems∗ [Extended Abstract] Olivier Pietquin SUPELEC - UMIGeorgiaTech-CNRS) 2 rue Edouard BelinMetz - France Add to Reading List Source URL: www.ilhaire.eu Language: English - Date: 2013-10-03 05:33:46 Machine learning Computational linguistics User interface techniques Multimodal interaction User interfaces Reinforcement learning Apprenticeship learning Computational learning theory Speech recognition Intelligent agent Dialog system Dialog manager
183	1 On Stochastic Feedback Control for Multi-antenna Beamforming: Formulation and Low-Complexity Algorithms Sun Sun, Min Dong, and Ben Liang Add to Reading List Source URL: www.comm.utoronto.ca Language: English - Date: 2014-05-05 14:44:36 Markov processes Markov models Mathematical optimization Stochastic control Dynamic programming Markov decision process Beamforming Reinforcement learning Optimal control Markov chain Q-learning Control theory
184	Boosted Bellman Residual Minimization Handling Expert Demonstrations Bilal Piot1,2 , Matthieu Geist1,2 , Olivier Pietquin3 1 3 Add to Reading List Source URL: www.metz.supelec.fr Language: English - Date: 2014-07-15 03:12:51 Operations research Machine learning Belief revision Reinforcement learning Dynamic programming Markov decision process Mathematical optimization Supervised learning
185	Neural Dynamics and Reinforcement Learning Presented By: Matthew Luciw DFT SUMMER SCHOOL, 2013 Add to Reading List Source URL: roboticsschool.ini.rub.de Language: English - Date: 2013-10-09 09:31:30 Machine learning Artificial intelligence Computational neuroscience Robotics Artificial neural networks Reinforcement learning Metalearning Dalle Molle Institute for Artificial Intelligence Research Q-learning Robot learning Reinforcement Cognitive robotics
186	Policy Gradient Coagent Networks Philip S. Thomas Department of Computer Science University of Massachusetts Amherst Amherst, MA 01002 Add to Reading List Source URL: psthomas.com Language: English - Date: 2013-11-16 15:49:43 Estimation theory Reinforcement learning Fisher information Likelihood function
187	Language Understanding for Text-based Games using Deep Reinforcement Learning Karthik Narasimhan∗ CSAIL, MIT Add to Reading List Source URL: www.emnlp2015.org Language: English - Date: 2015-09-04 01:25:56 Artificial neural networks Computational neuroscience Cybernetics Belief revision Reinforcement learning Recurrent neural network Q-learning Long short-term memory DQN Markov decision process Sepp Hochreiter Artificial intelligence
188	Data-Efficient Off-Policy Policy Evaluation for Reinforcement Learning Philip S. Thomas Dhruva Tirumala Emma Brunskill Carnegie Mellon University Add to Reading List Source URL: psthomas.com Language: English - Date: 2016-06-06 11:57:32 Estimation theory Statistical theory Statistical inference Robust statistics Least squares Estimator Bias of an estimator Efficiency Mean squared error L-estimator Consistent estimator Statistics
189	General Reinforcement Learning Jan Leike Future of Humanity Institute University of Oxford 9 June 2016 Add to Reading List Source URL: intelligence.org Language: English - Date: 2016-06-10 12:39:29 Machine learning Behaviorism Belief revision Reinforcement learning AIXI Reinforcement Ergodicity Learning
190	End-to-end Learning of Action Detection from Frame Glimpses in Videos Serena Yeung1 , Olga Russakovsky1,2 , Greg Mori3 , Li Fei-Fei1 1 Stanford University, 2 Carnegie Mellon University, 3 Simon Fraser University Add to Reading List Source URL: vision.stanford.edu Language: English - Date: 2016-04-13 14:34:27 Belief revision Conference on Computer Vision and Pattern Recognition Artificial neural network Activity recognition Object detection Reinforcement learning Image segmentation

UPDATE